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Abstract. I describe a general formalism for galaxy biasing |S| and its application 
to measurements of (3 (= fl 6 /b), e.g., via direct comparisons of light and mass 
and via redshift distortions. The linear and deterministic relation g — bS between 
the density fluctuation fields of galaxies g and mass 5 is replaced by the conditional 
distribution P(g\S) of these as random fields, smoothed at a given scale and at a given 
time. The mean biasing and its non-linearity are characterized by the conditional 
mean (g\S) = b(S)6, and the local scatter by the conditional variance 0^(5). This 
scatter arises from hidden effects on galaxy formation and from shot noise. 

For applications involving second-order local moments, the biasing is defined by 
three natural parameters: the slope b of the regression of g on S (replacing 6) , a non- 
linearity parameter 6, and a scatter parameter a^. The ratio of variances b 2 ^ and 
the correlation coefficient r mix these parameters. The non-linearity and scatter lead 
to underestimates of order b 2 /b 2 and cr^/b 2 in the different estimators of /3, which 
may partly explain the range of estimates. 

Local stochasticity affects the redshift-distortion analysis only by limiting the 
useful range of scales. In this range, for linear stochastic biasing, the analysis reduces 
to Kaiser's formula for b (not 6 V ar) independent of the scatter. The distortion analysis 
is affected by non-linearity but in a weak way. 

Estimates of the nontrivial features of the biasing scheme are made based on sim- 
ulations |p3 and toy models, and a new method for measuring them via distribution 
functions is proposed |53|. 



1 Introduction 

The fact that galaxies of different types cluster differently |23|, [II], |I], ||, |3|, |7j 
implies that many of them are biased tracers of the underlying mass distribu- 
tion. Without such biasing, it is hard to reconcile the existence of large volumes 
void of galaxies |Q and the spiky distribution of galaxies on ~ 100/i -1 Mpc 
scales, today j^] and at high redshifts |5(],|55|, with the standard theory of grav- 
itational instability theory (GI). There is partial theoretical understanding of 
the origin of biasing ||, |l|, |, f|| fl[ |, @, ||, ||, supported by cosmological 
simulations which confirm the existence of biasing PI fl @, H and show that 



it becomes stronger at high redshifts ^ [34| ^||. ^4 1 . 

The biasing is interesting as a constraint on galaxy formation, but it is 
also of great importance when estimating the cosmological density parameter 
Q. If one assumes linear and deterministic biasing and applies the linear 
approximation for GI, V-w = — f(Q)S, where /(f2) — £l 6 [^9|, the observables 
g and V-t> are related via the degenerate combination (3 = f(ft)/b. Thus, 

1 to appear in Large-Scale Surveys (IAP Symposium XIV, Paris, France, May 1998), eds 
Y. Mellier & S. Colombi (Editions Frontieres). 



one cannot pretend to have determined O by measuring [3 without a detailed 
knowledge of the biasing scheme. 

It turns out that different methods lead to different estimates of (3 in the 
range 0.4 < (3 < 1.1 jl5|, [57], [18, 16 1. The methods include: (a) comparisons of 
local moments of g (from redshift surveys) and 5 (from peculiar velocities) or 
the corresponding power spectra or correlation functions; (b) linear regressions 
of the fields g and S, or the corresponding velocity fields; and (c) analyses of 
redshift distortions in redshift surveys. In order to sharpen our determination 
of f2, it is important that we understand this scatter in /?. Some of it is due 
to the different types of galaxies involved and some may be due to the effects 
of non-linear gravity and perhaps other sources of systematic errors. Here 
we investigate the possible contribution of nontrivial properties of the biasing 
scheme such as stochasticity and non-linearity. 

The theory of density peaks in a Gaussian random field |3f| [| predicts that 
the linear galaxy-galaxy and mass-mass correlation functions are related via 
£gg( r ) = b 2 £,mm{r), where the biasing parameter b is a constant independent 
of scale r. However, a much more demanding linear biasing model is often 
assumed, in which the local density fields are related deterministically via 
the relation g(x) = bd(x). This is not a viable model because (a) it has no 
theoretical motivation, (b) if b > 1 it must break down in deep voids because 
values of g below —1 are forbidden, and (c) conservation of galaxy number 
implies that the linear biasing relation is not preserved during fluctuation 
growth. Thus, non-linear biasing, where b varies with 5, is inevitable. Indeed, 
the theoretical analysis of the biasing of collapsed halos by Mo & White fl46| , 
using the extended Press-Schechter approximation predicts that the biasing 
is non-linear. It provides a useful approximation for its behavior as a function 
of scale, time and mass. iV-body simulations, which provide a more accurate 
description (see Figure 0; J54|), show that this model is indeed useful. 

Note that once the biasing is non-linear at one smoothing scale, the biasing 
at any other smoothing scale must obey a different functional form of b(5) and 
is non-deterministic. Thus, any deviation from the simplified linear biasing 
model must also involve scale-dependence and scatter. Another inevitable 
source of scatter is physical scatter in the efficiency of galaxy formation as a 
function of 5, because the mass density at a certain smoothing scale (larger 
than the scale of galaxies) cannot be the sole quantity determining galaxy 
formation. For example, the random variations in the density on smaller scales 
and the local geometry of the background structure must play a role too. 
These hidden parameters would show up as scatter in the density-density 
relation. A third obvious source of scatter is the shot noise. One can try to 
remove it a priori, but this is sometimes difficult because of the small-scale 
anti-correlations introduced by the finite extent of galaxies. The alternative 
is to treat the shot noise as an intrinsic part of the local stochasticity of the 
biasing relation. The scatter arising from all the above is clearly seen for halos 
in simulations including gravity alone (§ ^) even before the complex processes 
of gas dynamics, star formation and feedback affect the biasing. 
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2 Local Moments: Variances and Linear Regression 



Let 8(x) be the field of mass-density fluctuations and g(x) the corresponding 
field of galaxy-density fluctuations, at a given time and for a given type of 
object. The fields are both smoothed with a fixed window which defines the 
term "local" . The local biasing relation is considered to be a random process, 
specified by the biasing conditional distribution P(g\8). Let the one-point 
probability distribution functions (PDF) P(8) and P(g) be of zero means and 
standard deviations a 2 = (S 2 ) and a 2 = (g 2 ). 

Define the mean biasing function b(8) by the conditional mean, 

b(8)8=(g\8)= J dgP(g\8)g. (1) 

This function is plotted in Figure |l|. It is a natural generalization of the 
deterministic linear biasing relation, g = b±8. The function b(8) allows for any 
possible non-linear biasing. We find it useful to characterize the function b(8) 
by its moments b and b defined by 

b= (b(S)8 2 )/a 2 and b 2 = (b 2 (8) 8 2 )/<r 2 . (2) 

It will become clear that b is the natural extension of b\ and that bjb is the 
relevant measure of non-linearity, independent of stochasticity. 

The local statistical character of the biasing can be expressed by the con- 
ditional moments of higher order about the mean at a given <5. Define the 
random biasing field e by e = g — {g\8}, with (e\8) — 0. The local variance of 
e at a given 8 defines the biasing scatter function a\,(8) and by averaging over 
8 one obtains the local biasing scatter parameter. 

a 2 (8)^(e 2 \8)/a 2 , o 2 h = (<?}/ o 2 . (3) 

The scaling by a 2 is for convenience. The function (e 2 \8) x / 2 is marked by error 
bars in Figure |l[ Here and below we make use of a straightforward lemma, 
valid for any functions p(g) and q(8): 

(P(g)q(8)) = ((p(g)\8) gl5 q(8)) s . (4) 

From the three basic parameters defined above one can derive other biasing 
parameters. A common one is the ratio of variances, 

&L^/a 2 = 6 2 + ^. (5) 

The second equality is a result of Eq. It immediately shows that & var is 
sensitive both to non-linearity and to stochasticity, with 6 var > b. This makes 
6 var biased compared to b, 

b vaI = b (b 2 /b 2 + al/b 2 ) 1 ' 2 . (6) 
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Using Eq. (|), the mean parameter 6 is related to the covariance, 

bo 2 = (g6). (7) 

Thus, b is the slope of the linear regression of g on 5, which makes it a natural 
generalization of b\ . Unlike the variance a 2 in Eq. (0) , the covariance in Eq. (Q) 
has no contribution from Ob ■ A complementary parameter to 6 va r is the linear 
correlation coefficient, 

r ee (gS)/(a g a) = b/b val = (b 2 /b 2 + a 2 /b 2 ) ^ . (8) 
The "inverse" regression, of 5 on g, yields another biasing parameter: 

b inv ee a 2 J(gS) = 6 var /r = b (b 2 /b 2 + cj 2 /b 2 ) . (9) 

Thus, 6; nv is biased relative to b, even more than 6 var . The parameter 6j nv 
is close to what is measured in practice by two-dimensional linear regression 
[{52) , because the errors in S are larger than in g. Note that b and <7b nicely 
separate the non-linearity and stochasticity, while & var , r and fei nv mix them. 
In the case of linear stochastic biasing, the above parameters reduce to 

b = b = h, b val =h(l + o-l/bl) 1 ' 2 , r = h/bw, b inv = bt (1 + 0-l/b 2 ) . 

(10) 

Thus, b\ < 6 va r < binv I n the case of non-linear deterministic biasing: 

6^6, a b = 0, 6 var = 6, r = 6/6, 6 inv = 6 2 /6. (11) 

In the fully degenerate case of linear and deterministic biasing, all the 6 pa- 
rameters are the same, and only then r = 1. 

In actual applications, the above local biasing parameters are involved 
when the parameter "/?" is measured from observational data. For linear 
and deterministic biasing this parameter is defined unambiguously as 0i = 
/(fi)/6x, but any deviation from this model causes us to measure different 
[3's by the different methods. For example, it is /3 var = f{Q)/b val which is 
determined from o~ g and <r/(f2). The former is typically determined from a 
redshift survey, and the latter either from an analysis of peculiar velocity data, 
from the abundance of rich clusters, or by COBE normalization of a specific 
power-spectrum shape. In the case of stochastic biasing 6 var is always an 
overestimate of 6, Eq. (|^) , and when the biasing is linear 6 var is an overestimate 
of b\. Therefore /3 var is underestimated accordingly. 

Another useful way of estimate [3 is via the linear regression of the fields 
in our cosmological neighborhood, e.g., — V-w(a;) on g{x) |l7], |33|, (52) . In the 
mildly-non-linear regime, — V-v(ar) is actually replaced by another function of 
the first spatial derivatives of the velocity field, which better approximates the 
scaled mass- density field f(Ct)S(x) @. The regression is effectively 5 on g, 
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because the errors in V-W (or fS) are typically more than twice as large as the 
errors in g. Hence, the measured parameter is close to /3i nv = /(0)/&i n y In 
the case of linear and stochastic biasing, Eq. ([!(]) , b- mv is an overestimate of bi 
so the corresponding (3 is underestimated accordingly. 

3 Two-Point Correlations: Redshift Distortions 

For the analysis of redshift-distortion we need to deal with spatial correlations. 
Given the random biasing field e, we define the two-point biasing-matter cross- 
correlation function and the biasing auto-correlation function by 

= M 2 >, Cee(r) = <eie 2 ), (12) 

where the averaging is over the ensembles at points 1 and 2 separated by r. 
We define the biasing as local if £ em (0 = for any r and £ cc (r) = for r > r^, 
where r^ is on the order of the basic smoothing scale. Using lemmas that are 
two-point equivalents of Eq. (|4|), one obtains analogous relations to Eqs. (Q) 
and (^). In the case of linear and local biasing, these become 

£gm(r) = hUm(r), £gg(r) = 6?£ mm (r) + & e (r), (13) 

Note that the biasing parameter that appears here is b\, not & V ar- 

To see how the power spectra are affected by the biasing scatter, we ap- 
proximate the local biasing by a step function: £ ee (r) = c^cr 2 for r < n, and 
zero otherwise. Recalling that the power spectra are the Fourier transforms of 
the corresponding correlation functions, we get for k <C r^ 1 , from Eq. (|l"3|), 

Pgm(fc) = hP mm (k), P gg (fc) = 6 2 P mm (fc) + a 2 a 2 V b , (14) 

where V b is the volume associated with tv We see that the local biasing 
scatter adds a constant to P m (k). 

We can now proceed to estimating (3 via redshift distortions 
|25| , H| , p] , p4] , |4Q] . To first order, the local galaxy density fluctuations in 
redshift space (<7 S ) and real space (g) are related by g s — g — du/dr, where 
u is the radial component of the peculiar velocity v. Assuming no velocity 
biasing, linear GI predicts du/dr = — ^ 2 /(f2)(5, where ^ 2 is a geometrical 
factor depending on the angle between v and x. Thus, the basic linear relation 
for redshift distortions is g s = g + j /i 2 <5. The general expression for redshift 
distortions is obtained from this basic relation by averaging (gig?) over the 
distributions of 6 at a pair of points separated by r: 

= 4gM + 2(/M 2 ) Cgm(r) + (j> 2 ) 2 UW. (15) 

Recalling that the power spectra are the Fourier transforms of the correspond- 
ing correlation functions, one can equivalcntly write an expression involving 
-Pg g (fc), -Pgg(fc), -Pgm(fc) and -Pmm(fc), or the analogous spherical harmonics. 
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Next we tie in the biasing scheme. In the simplified case of linear and 
deterministic biasing, one simply has P gg = b\P gm = 6f P mm , so the distortion 
relation reduces to Kaiser's formula Q, P gg = P gg (l + [i 2 f3i) 2 , where /3i = 
f(Q)/bi. In the more realistic case of linear, local, and stochastic biasing, first 
at zero lag, £ ee (0) = a\a 2 and £ gg (0) = frjL-£mm(0)- Then, via Eq. (||) and 
Eq. (Q), the general distortion relation, Eq. (|l5|), reduces to 

al s = + 2(/ M 2 )rC + (f^)Xll (16) 

In this local equation both 6 var and r are involved in a non-trivial way; the 
distortions depend on the scatter, reflecting the a 2 term in Eq. (^). On the 
other hand, at large separations r > n,, where £ ee vanishes, one obtains instead, 
from Eq. @, 

e gg (r) = &g(r)[i + 2(/ M 2 ) + (V) 2 &r 2 ]- (17) 

This is simply the Kaiser formula again, which, unlike Eq. (|l6|), is indepen- 
dent of the biasing scatter! It involves only the mean biasing parameter 61, in 
an expression that is indistinguishable from the deterministic case. This is a 
straightforward result of the assumed locality of the biasing scheme: the bias- 
ing scatter at two distant points is uncorrelated and therefore its contribution 
to £ gg cancels out. 

The distortion relation for P(k) becomes more complicated because of the 
additive term in Eq. (|l4|). For linear biasing, when substituting Eq. ( |l4| ) in 
the linear distortion relation, the terms analogous to the ones involving b^ 1 
and b~[ 2 in Eq. ( |l7|) for £ are multiplied by [1 — a 2 a 2 Vb/ P gg (k)], a function 
of k. The distortion relation for P(k) is thus affected by the biasing scatter 
in a complicated way. However, there may be a significant k range around 
the peak of P(k) in which the additive scatter term is small compared to the 
rest. In this range the relation reduces to an expression similar to Eq. ( |l7| ) 
for the corresponding power spectra. Still, the scatter term always dominates 
Eq. ( |l4| ) at small and at large fc's. 

Equation (7) of Pen J50j] - which involves 6 var and r like our Eq. (|l6|), may 
leave the impression that the redshift-distortion expression depends on the 
scatter. In order to obtain his relation from the general distortion relation, 
one has to define fc-dependent biasing parameters by P gg (k) — b val (k) 2 P mm (fc) 
and P g m(fc) = b val (k)r(k)P mm (k). (Pen's (3 refers to his &i, which is our 
6 var , except that he allows it to vary with k). In the case of local biasing, 
a comparison to our Eq. (jl^) yields b v&r (k) 2 — b\ + a^a 2 Vb / P m m(k) and 
b val {k)r{k) = b\. In the k range near the peak of P m m(k) 7 where the constant 
term in Eq. ( |l4| ) may be negligible, one has 6 var (fc) = b\ and r(fc) = 1, and 
there is indeed no sign of the stochasticity in the distortion relation. 

While its sensitivity to stochasticity is indirect, the redshift distortion anal- 
ysis is sensitive to the non-linearity, of the biasing. A proper analysis would 
require a non-linear treatment including a non-linear generalization of the GI 
relation V v = — fS, because the non-linear effects of biasing and gravity enter 
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at the same order. The result is more complicated than Eq. (jlj), but is calcu- 
lable in principle once one knows the function b(S) and the one- and two-point 
probability distribution functions of 5. 

4 Biasing in Simulations and Toy Models 

In the scheme outlined above, the function b(S) contains the information about 
the mean biasing (via the parameter b) and its non- linear features (e.g., via 
b/b). The next quantity of interest in the case of stochastic biasing is the 
conditional standard deviation, the function ab(S), and its variance over 6, a^. 
In order to evaluate the actual effects of non-linear and stochastic biasing on 
the various measurements of /?, one should try to evaluate these functions or 
parameters from simulations, theoretical approximations and observations. 

In an ongoing study that generalizes earlier investigations JTo| , fl6| |, we 
are investigating the biasing in high-resolution TV-body simulations of several 
cosmological scenarios, both for galactic halos and for galaxies as identified 
using semi-analytic models j54j|. We refer here to a representative cosmo- 
logical model: Q = 1 with a rCDM power spectrum which roughly obeys 
the constraints from large-scale structure. The simulation mass resolution is 
2 x 1O 1O M inside a box of comoving side 85/i _1 Mpc. The present epoch is 
identified with erg = 0.6. Figure [l] demonstrates the qualitative features of 
the biasing scheme. The non-linear behavior at S < 1 is characteristic of all 
masses, times, and smoothing scales: b(6) -C 1 near 6 = — 1 and it steepens to 
b(S) > 1 towards 8 — 0. At 8 > 1 the behavior strongly depends on the mass, 
time and smoothing scale. The scatter in the figure includes both shot noise 
and physical scatter which are hard to separate properly. In the case shown 
at z = 0, the non-linear parameter is b 2 jb — 1.08, and the scatter parameter 
is <j\jb = 0.15. The effects of stochasticity and non-linearity in this specific 
case thus lead to moderate differences in the various measures of /3, on the 
order of 20 — 30%. Gas-dynamics and other non-gravitational processes may 
extend the range of estimates even further. 

Given the distribution P(S) of the matter fluctuations, the biasing function 
b(S) should obey by definition at least the following two constrains. First, g > 
— 1 everywhere, because the galaxy density p g cannot be negative, with g = — 1 
at 5 = — 1, because there are no galaxies where there is no matter. Second, 
(g) = because g describes fluctuations about the mean galaxy density. An 
example for a simple functional form that obeys the constraint at 5 — — 1 and 
reduces to the linear relation near 6 = is |ljj 

(g\S)=c(l + S) b -l. (18) 

The constraint (g) = is to be enforced by a specific choice of the factor c for 
a given b. With b > 1, this functional form indeed provides a reasonable fit 
to the simulated halo biasing relation in the 5 < regime. However, the same 
value of b does not necessarily fit the biasing relation in the <5 > regime. A 
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Figure 1: Biasing of galactic halos versus mass in a cosmological TV-body 
simulation, demonstrating non-linearity and stochasticity. The conditional 
mean [(g\S) = b(S)5] (solid curve) and scatter [(e 2 |<5) = a 2 (S)a 2 ] (error bars) 
are marked. The fields smoothed with a top-hat window of radius 8/i _1 Mpc 
are plotted at the points of a uniform grid. The halos are selected above a 
mass threshold of 2 x 1O 12 M . Left: at the time when erg = 0.6 (e.g., z = 0). 
Right: at an earlier time when as = 0.3 (e.g., z = 1). (Based on |p4f .) 



better approximation could thus be provided by a combination of two functions 
like Eq. (|l^) with two different parameters b n and b p in the regimes 6 < and 
5 > respectively. The parameter b n is always larger than unity while b„ 
ranges from slightly below unity to much above unity. The best fit to Fig. p] 
at z = has b n ~ 2 and b p ~ 1. At high redshift both b n and b p become 
significantly larger. 

The non-linear biasing function can alternatively be parameterized by 

Since g must average to zero, this general power series can be written as 

(g\6) = hS + b 2 (5 2 - <r 2 )/2 + b 3 (5 3 - S)/6 + (20) 

where a 2 = (<5 2 ), S = (S 3 ), etc. This determines the constant term bo. The 
constraint at —1 provides another relation between the parameters. Therefore, 
the expansion to third order contains only two free parameters out of four. 

In order to evaluate the parameters b and b for these non-linear toy models, 
we approximate the distribution P(S) as log-normal in p/p — 1 + 8 Jl2|, [39), 
where a is the single free parameter. With Eq. (|20|), Assuming b 2 b\ and 
(T<1, one obtains b 2 /b ~ 1 + (1/2) (&2/&1) cr 2 . This is always larger than 
unity, but the deviation is small. Alternatively, using the functional form 
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of Eq. (|l8|), with b n ranging from 1 to 5 and b p ranging from 0.5 to 3, and 
with a = 0.7, we find numerically that b/b is in the range 1.0 to 1.15. These 
two toy models, calibrated by the iV-body simulations, indicate that despite 
the obvious non-linearity, especially in the negative regime, the non-linear 
parameter b/b is typically only slightly larger than unity. This means that the 
effects of non-linear biasing on measurements of f3 are likely to be relatively 
small. 



5 Observational Constraints on Biasing 

Direct constraints on the biasing field should be provided by the data them- 
selves, of galaxy density (e.g., from redshift surveys) versus mass density (from 
peculiar velocity surveys, gravitational lensing, etc.). A hint of scatter in the 
biasing relation is the fact that the smoothed density peaks of the Great At- 
tractor (GA) and Perseus Pisces (PP) are of comparable height in the mass 
distribution as recovered by POTENT from observed velocities jl5[ |l3], [l9| , 



while PP is higher than GA in the galaxy maps [|33| 52 1. Another piece of 
indirect evidence for scatter comes from a linear regression of the 1200 km s -1 - 
smoothed density fields of POTENT mass and optical galaxies in our cosmolog- 
ical neighborhood, which yields a x 2 ~ 2 per degree of freedom 1 33 . One way 



to obtain a more reasonable y 2 ~ 1 is to assume a biasing scatter of <7b ~ 0.5 
(while a ~ 0.3 at that smoothing). With b\ ~ 1, one has a\/b\ ~ 0.25. This 
is only a crude estimate; there is yet much to be done with future data along 
the lines of reconstructing the "biasing field" in a given region of space. 

We have recently worked out a promising way to recover the mean biasing 
function b(S) and its associated parameters b and b from a measured PDF of the 
galaxy distribution [ p3[ . This method is inspired by a "de-biasing" technique 
by Narayanan & Weinberg p4fl . If the biasing relation g(S) were deterministic 
and monotonic, then it could be derived directly from the cumulative PDFs 
of galaxies and mass, C s (g) and C(8), via 

g(S)=C- 1 [C(S)}, (21) 

We find, using halos in TV-body simulations, that this is a good approximation 
for (g\S) despite the significant scatter about it. This is demonstrated in 
Figure |^. 

The other key point is that the cumulative PDF of mass density is rela- 
tively insensitive to the cosmological model or the power spectrum of density 
fluctuations |, §. We find using a series of -/V-body simulations of the 
CDM family of models in a flat or an open universe with and without a tilt 
in the power spectrum, that, compared to the differences between C g and C, 
the latter can always be properly approximated by a cumulative log-normal 
distribution of 1 + 6 with a single parameter a. Deviations may show up in 
the extreme tails of the distribution [9, which may affect the skewness and 
higher moments but are of little concern for our purpose here. This means 
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Figure 2: The PDFs and the mean biasing function, from a cosmological 
iV-body simulation of as = 0.3 (z = 1) with top-hat smoothing of 8/i _1 Mpc 
and for halos of M > 2 x 1O 12 M . Left: the cumulative probability distri- 
butions Cof density fluctuations of halos (<?) and of mass (S). A log- normal 
distribution is shown for comparison. The errors are by bootstrap re-sampling 
of halos. The horizontal separation between the curves approximates the mean 
biasing function (g\6) = b(5)a 2 at the corresponding value of 6. Right: the 
density fields of halos and mass compared at grid points. The symbols de- 
scribe the mean biasing function as derived from this data in bins. The solid 
curve is derived from the PDFs via Eq. (pl|). The dotted lines mark the error 
corresponding to the error in the PDF. (Based on |||.) 

that in order to evaluate b(S) one only needs to measure C g (g) from a galaxy 
density field, and add the rms a of mass fluctuations at the same smoothing 
scale. Since the redshift surveys are by far richer and more extended than 
peculiar-velocity samples, this method will allow a much better handle on b(S) 
than the local comparison of density fields of galaxies and mass. 

6 Conclusions 

The key feature in our biasing formalism is the natural separation between 
non-linear and stochastic effects. The non-linearity is expressed by the condi- 
tional mean via b(6), and the statistical scatter is measured by the conditional 
standard deviation, <Jb{8), and higher moments if necessary. For analyses us- 
ing local moments of second order, the biasing scheme is characterized by 
three parameters: b measuring the mean biasing, b/b measuring the effect of 
non-linearity, and o\,jb measuring the effect of stochasticity. 

Deviations from linear and deterministic biasing typically result in biased 
estimates of f3, which depend on the actual method of measurement. The 
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non-linearity and the scatter lead to underestimates of order b 2 /b and o-\jb 2 
respectively in the different estimators of (3 relative to $ — f(fl)/b. Based 
on iV-body simulations and toy models, the effects of non-linear biasing are 
typically on the order of 20% or less, and the effects of scatter could be larger. 
One expects the f3 parameters to be biased in the following order: j3- mv < 

/3var < fr 

The stochasticity affects the linear redshift-distortion analysis only by lim- 
iting the useful range of scales. In this range, the basic expression reduces to 
the simple Kaiser formula for b(S) = b = b\ (not 6 V ar), and it does not involve 
the scatter at all. The distortion analysis is in principle sensitive to the non- 
linear properties of biasing, but they are expected to be weak, and of the same 
order as the effects of non-linear GI. This is good news for the prospects of 
measuring an unbiased j3 from redshift distortions in the large redshift surveys 
of the near future (2dF and SDSS). 

The study of stochastic and non-linear biasing should be extended to ad- 
dress the time evolution of biasing because many relevant measurements of 
galaxy clustering are now being done at high redshifts. As seen in Fig. nL the 
biasing is clearly a strong function of cosmological epoch Ell [2(| |5q, [55|, 0, 
|]| In particular, if galaxy formation is limited to a given epoch 

and the biasing is linear, one can show |p6f that the linear biasing factor b\ 
would eventually approach unity as a simple result of the continuity equation. 
Tegmark & Peebles Q have recently generalized the analytic study of time 
evolution to the case of stochastic but still linear biasing and showed how 6 V ar 
and r approach unity in this case. These studies should be extended to the 
general non-linear case using our formalism. Our current simulations [ |54| are 
aimed at this goal. The analysis of simulations could also be extended to 
include non-local biasing, using the biasing correlations as defined here. 

The PDF (or count in cells) of galaxy density from a large-scale redshift 
survey, plus an estimate of a of the corresponding mass density, allow a mea- 
sure of the mean biasing function b(S) and the corresponding non-linearity 
parameter b/b. This can be done at low or high redshifts. Mapping of the 
biasing field in our cosmological neighborhood, and estimates of the biasing 
scatter, are feasible with current and future measurements of peculiar veloc- 
ities and careful comparisons to the galaxy distribution. The reconstruction 
of the large-scale mass distribution based on weak gravitational lensing is also 
becoming promising for this purpose. 

In summary, in order to use the measurements of (3 for an accurate evalu- 
ation of O, one should consider the effects of non-linear and stochastic biasing 
and the associated complications of scale dependence, time dependence, and 
type dependence. The current different estimates are expected to span a range 
of ~ 30% in j3 due to stochastic and non-linear biasing. The analysis of redshift 
distortions seems to be most promising; once it is limited to the appropriate 
range of scales, the analysis is independent of stochasticity and the non-linear 
effects are expected to be relatively small. The mean biasing function can 
be extracted from the galaxy PDF, and the scatter from theory and local 
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cosmography. 
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